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Abstract 



We consider the optimal packet scheduling problem in a single-user energy harvesting wireless 
communication system. In this system, both the data packets and the harvested energy are modeled to 
arrive at the source node randomly. Our goal is to adaptively change the transmission rate according to 
the traffic load and available energy, such that the time by which all packets are delivered is minimized. 
Under a deterministic system setting, we assume that the energy harvesting times and harvested energy 
amounts are known before the transmission starts. For the data traffic arrivals, we consider two different 
scenarios. In the first scenario, we assume that all bits have arrived and are ready at the transmitter 
before the transmission starts. In the second scenario, we consider the case where packets arrive during 
the transmissions, with known arrival times and sizes. We develop optimal off-line scheduling policies 
which minimize the time by which all packets are delivered to the destination, under causality constraints 
on both data and energy arrivals. 
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I. Introduction 

We consider wireless communication networks where nodes are able to harvest energy from 
nature. The nodes may harvest energy through solar cells, vibration absorption devices, water 
mills, thermoelectric generators, microbial fuel cells, etc. In this work, we do not focus on how 
energy is harvested, instead, we focus on developing transmission methods that take into account 
the randomness both in the arrivals of the data packets as well as in the arrivals of the harvested 
energy. As shown in Fig. \T\ the transmitter node has two queues. The data queue stores the data 
arrivals, while the energy queue stores the energy harvested from the environment. In general, the 
data arrivals and the harvested energy can be represented as two independent random processes. 
Then, the optimal scheduling policy becomes that of adaptively changing the transmission rate 
and power according to the instantaneous data and energy queue lengths. 
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Fig. 1. An energy harvesting communication system model. 

While one ideally should study the case where both data packets and energy arrive randomly in 
time as two stochastic processes, and devise an on-line algorithm that updates the instantaneous 
transmission rate and power in real-time as functions of the current data and energy queue 
lengths, this, for now, is an intractable mathematical problem. Instead, in order to have progress 
in this difficult problem, we consider an idealized version of the problem, where we assume 
that we know exactly when and in what amounts the data packets and energy will arrive, and 
develop an optimal off-line algorithm. We leave the development of the corresponding on-line 
algorithm for future work. 

Specifically, we consider a single node shown in Fig. [2] We assume that packets arrive at times 
marked with x and energy arrives (is harvested) at points in time marked with o. In Fig. [21 Bi 
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Fig. 2. System model with random packet and energy arrivals. Data packets arrive at points denoted by x and energies arrive 
(are harvested) at points denoted by o. 

denotes the number of bits in the ith arriving data packet, and Ei denotes the amount of energy 
in the ith energy arrival (energy harvesting). Our goal then is to develop methods of transmission 
to minimize the time, T, by which all of the data packets are delivered to the destination. The 
most challenging aspect of our optimization problem is the causality constraints introduced by 
the packet and energy arrival times, i.e., a packet may not be delivered before it has arrived and 
energy may not be used before it is harvested. 

The trade-off relationship between delay and energy has been well investigated in traditional 
battery powered (unrechargeable) systems. References Q]|-[[6l investigate energy minimization 
problems with various deadline constraints. Reference [Q]| considers the problem of minimizing 
the energy in delivering all packets to the destination by a deadline. It develops a lazy scheduling 
algorithm, where the transmission times of all packets are equalized as much as possible, subject 
to the deadline and causality constraints, i.e., all packets must be delivered by the deadline 
and no packet may be transmitted before it has arrived. This algorithm also elongates the 
transmission time of each packet as much as possible, hence the name, lazy scheduling. Under 
a similar system setting, |2j proposes an interesting novel calculus approach to solve the energy 
minimization problem with individual deadlines for each packet. Reference develops dynamic 
programming formulations and determines optimality conditions for a situation where channel 
gain varies stochastically over time. Reference flU considers energy-efficient packet transmission 
with individual packet delay constraints over a fading channel, and develops a recursive algorithm 
to find an optimal off-line schedule. This optimal off-line scheduler equalizes the energy- 
rate derivative function as much as possible subject to the deadline and causality constraints. 
References O and [|6l extend the single-user problem to multi-user scenarios. Under a setting 
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similar to [QQ|, we investigate the average delay minimization problem with a given amount 
of energy, and develop iterative algorithms and analytical solutions under various data arrival 
assumptions in 0. References [[8||-[fT4| investigate delay optimal resource allocation problems 
under various different settings. References lf8l- lfT0l consider average power constrained delay 
minimization problem for a single-user system, while lfTTTl - [fT4ll minimize the average delay 
through rate allocation in a multiple access channel. 

In this paper, we consider a single-user communication channel with an energy harvesting 
transmitter. We assume that an initial amount of energy is available at t = 0. As time progresses, 
certain amounts of energies will be harvested. While energy arrivals should be modeled as a 
random process, for the mathematical tractability of the problem, in this paper, we assume that 
the energy harvesting procedure can be precisely predicted, i.e., that, at the beginning, we know 
exactly when and how much energy will be harvested. For the data arrivals, we consider two 
different scenarios. In the first scenario, we assume that packets have already arrived and are 
ready to be transmitted at the transmitter before the transmission starts. In the second scenario, we 
assume that packets arrive during the transmissions. However, as in the case of energy arrivals, 
we assume that we know exactly when and in what amounts data will arrive. Subject to the 
energy and data arrival constraints, our purpose is to minimize the time by which all packets 
are delivered to the destination through controlling the transmission rate and power. 

This is similar to the energy minimization problem in 0], where the objective is to minimize 
the energy consumption with a given deadline constraint. In this paper, minimizing the trans- 
mission completion time is akin to minimizing the deadline in (T). However, the problems are 
different, because, we do not know the exact amount of energy to be used in the transmissions, 
even though we know the times and amounts of harvested energy. This is because, intuitively, 
using more energy reduces the transmission time, however, using more energy entails waiting 
for energy arrivals, which increases the total transmission time. Therefore, minimizing the 
transmission completion time in the system requires a sophisticated utilization of the harvested 
energy. To that end, we develop an algorithm, which first obtains a good lower bound for the 
final total transmission duration at the beginning, and performs rate and power allocation based 
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on this lower bound. The procedure works progressively until all of the transmission rates and 
powers are determined. We prove that the transmission policy obtained through this algorithm 
is globally optimum. 

II. Scenario I: Packets Ready Before Transmission Starts 

We assume that there are a total of B bits available at the transmitter at time t — 0. We also 
assume that there is E amount of energy available at time t — 0, and at times si, s 2 , . . ., sk, 
we have energies harvested with amounts E\, E 2 , . . . , Ek, respectively. This system model is 
shown in Fig. |3j Our objective is to minimize the transmission completion time, T. 
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Fig. 3. System model with all bits available at the beginning. Energies arrive at points denoted by o. 

We assume that the transmitter can adaptively change its transmission power and rate according 
to the available energy and the remaining number of bits. We assume that the transmission rate 
and transmit power are related through a function, g(p), i.e., r = g(p). We assume that g(p) 
satisfies the following properties: i) g(0) = and g(p) — > oo as p — > oo, ii) g(p) increases 
monotonically in p, iii) g(p) is strictly concave in p, iv) g(p) is continuously differentiable, and v) 
g(p)/p decreases monotonically in p. Properties i)-iii) guarantee that g~ l (r) exists and is strictly 
convex. Property v) implies that for a fixed amount of energy, the number of bits that can be 
transmitted increases as the transmission duration increases. It can be verified that these properties 
are satisfied in many systems with realistic encoding/decoding schemes, such as optimal random 
coding in single-user additive white Gaussian noise channel, where g(p) = |log(l +p). 

Assuming the transmitter changes its transmission power N times before it finishes the 
transmission, let us denote the sequence of transmission powers as pi, p 2 , . . ., Pn, and the 
corresponding transmission durations of each rate as l x , l 2 , . . ., In, respectively; see Fig. HI 
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Fig. 4. The sequence of transmission powers and durations. 



Then, the energy consumed up to time t, denoted as E(t), and the total number of bits departed 
up to time t, denoted as B(t), can be related through the function g as follows: 



E{t) = ^Pik+Pi+i I * - 

i=l \ i=l 

B(t) = J2g(Pi)h + gipurx) (t-J2 l * 



(1) 

(2) 



i=l 



i=l 



where i = max{i : Y^)=i h — 0- 

Then, the transmission completion time minimization problem can be formulated as: 



mm 
p.i 

s.t. 



T 

E(t) < E t , 

i:Si<t 

B{T) = B 



< t < T 



(3) 



First, we determine the properties of the optimum solution in the following three lemmas. 

Lemma 1 Under the optimal solution, the transmit powers increase monotonically, i.e., p\ < 
p 2 < ■ ■ ■ < Pn- 

Proof: Assume that the powers do not increase monotonically, i.e., that we can find two powers 
such that pi > p i+1 . The total energy consumed over this duration is piU + Pi+ik+x. Let 

Pik + Pi+ik+i 



P'i = P'i+i 



r[ = r' i+1 = g 



h + h+i 
Pik + Pi+ik+i 
h + h+i 



(4) 
(5) 
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Then, we have p\ < p^ p' i+1 > Pi + \. Since p[li < pji, the energy constraint is still satisfied, and 
thus, the new energy allocation is feasible. We use r'^r' i+l to replace r^r^+i in the transmission 
policy, and keep the rest of the rates the same. Then, the total number of bits transmitted over 
the duration li + becomes 

rji + r i+1 l i+1 = g I — l + l — ) (*< + 

> 9 (Pi) r-A — (ii + k+i) + 9 (Pi+i) -rrj—(h + ku) 

= rf/j + r i+ il i+ i (6) 

where the inequality follows from the fact that g(j>) is concave in p. Therefore, the new policy 
departs more bits by time Y^=a h- Keeping the remaining transmission rates the same, the new 
policy will finish the entire transmission over a shorter duration. Thus, the original policy could 
not be optimal. Therefore, the optimal policy must have monotonically increasing powers (and 
rates). ■ 

Lemma 2 The transmission power/rate remains constant between energy harvests, i.e., the 
power/rate only potentially changes when new energy arrives. 

Proof: Assume that the transmitter changes its transmission rate between two energy harvesting 
instances Sj, s i+ i. Denote the rates as r n , r n+1 , and the instant when the rate changes as as 
shown in Fig. [5J Now, consider the duration [sj,s i+1 ). The total energy consumed during the 
duration is p n {s' i - Si) +p n+1 (s i+1 - s-). Let 

/ = Pnjs'j ~ Sj) +p n+1 (Sj +1 - 
Si+1 — s i 
r , = f Pn(Sj ~ Si) + p n+1 (Sj +1 - 
V s i+l ~ s i 

Now let us use r' as the new transmission rate over [s^ s i+ i), and keep the rest of the rates the 
same. It is easy to check that the energy constraints are satisfied under this new policy, thus this 
new policy is feasible. On the other hand, the total number of bits departed over this duration 
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Fig. 5. The rate must remain constant between energy harvests. 



under this new policy is 

u \ f PrJA ~ Sj) +Pn+l(s i+1 - S$ \ 

r (s i+ i - Si) = g (s i+ i - Si) 

V - Sj J 

> (g(Pn)^^- + giPn+if^ 1 ^) (s i+1 - Si ) 

\ Sj+i — s i Si+i — Si / 

= r n (s- - Si) + r n +i(si+i - s'i) (9) 

where the inequality follows from the fact that g(p) is concave in p. Therefore, the total number 
of bits departed under the new policy is larger than that under the original policy. If we keep 
all of the remaining rates the same, the transmission will be completed at an earlier time. This 
conflicts with the optimality of the original policy. ■ 

Lemma 3 Whenever the transmission rate changes, the energy consumed up to that instant 
equals the energy harvested up to that instant. 

Proof: From Lemma [2} we know that the transmission rate can change only at certain energy 
harvesting instances. Assume that the transmission rate changes at Sj, however, the energy 
consumed by s i5 which is denoted by E(s,i), is less than Y^o^j- ^ e denote the energy gap 
by A. Let us denote the rates before and after Si by r n , r n+1 . Now, we can always have two 
small amounts of perturbations S n , 5 n+ i on the corresponding transmit powers, such that 

Pn=Pn + K (10) 
Pn+1 = Pn+1 - <Wl (11) 

(12) 



7 



We also make sure that 5 n and 5 n+ i are small enough such that 5 n l n < A, and p' n < p' n+1 . If 
we keep the transmission rates over the rest of the duration the same, under the new transmission 
policy, the energy allocation will still be feasible. The total number of bits departed over the 
duration (Y^i k, E2i is 

g(p'n) l n + g(Pn+l)ln+l > g(Pn)h + g(Pn+l%+l (13) 

where the inequality follows from the concavity of g(p) in p, and the fact that p n l n +p n+ il n+ i = 
p'J-n +Pn+iWi> Pn < Pn < Pn+i < Pn+i, as shown in Fig. [6J This conflicts with the optimality 
of the original allocation. ■ 




Pn P'n Pn+1 Pn+1 P 



Fig. 6. g(p) is concave in p. 

We are now ready to characterize the optimum transmission policy. In order to simplify the 
expressions, we let i = 0, and let s m+ i = T if the transmission completion time T lies between 

s m and s m +i. 

Based on Lemmas \T\ [2] and [3j we can characterize the optimal policy in the following way. 
For given energy arrivals, we plot the total amount of harvested energy as a function of t, which 
is a staircase curve as shown in Fig. |7J The total energy consumed up to time t can also be 
represented as a continuous curve, as shown in Fig.fTJ In order to satisfy the feasibility constraints 
on the energy, energy consumption curve must lie below the energy harvesting curve at all times. 
Based on Lemma [2} we know that the optimal energy consumption curve must be linear between 
any two consecutive energy harvesting instants, and the slope of the segment corresponds to the 
transmit power level during that segment. Lemma [3] implies that whenever the slope changes, 
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Fig. 7. An interpretation of transmission policies satisfying Lemmas 1, 2 and 3. 



the energy consumption curve must touch the energy harvesting curve at that energy harvesting 
instant. Therefore, the first linear segment of the energy consumption curve must be one of the 
lines connecting the origin and any corner point on the energy harvesting curve before t = T. 
Because of the monotonicity property of the power given in Lemma [TJ among those lines, we 
should always pick the one with the minimal slope, as shown in Fig. [7J Otherwise, either the 
feasibility constraints on the energy will not be satisfied, or the monotonicity property given in 
Lemma 1 will be violated. For example, if we choose the line ending at the corner point at 
s 3 , this will violate the feasibility constraint, as the energy consumption curve will surpass the 
energy arrival curve. On the other hand, if we choose the line ending at the corner point at s±, 
then the monotonicity property in Lemma \T\ will be violated, because in that case, the slope of 
the following segment would be smaller. These properties must hold similarly for p 2 , pz, ■ ■ ., Pn- 
We also observe that, for given T, the optimal transmission policy is the tightest string below 
the energy harvesting curve connecting the origin and the total harvested energy by time T. This 
is similar to the structure in 0. 

We state the structure of the optimal policy formally in the following theorem. 

Theorem 1 For a given B , consider a transmission policy with power vector p = [pi, p 2 , . . . , Pn] 
and corresponding duration vector 1 = [Zi, l 2 , ■ ■ ■ , In]- This policy is optimal if and only if it has 
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the following structure: 



N 



J29(Pn)ln = B (14) 



n=l 



and for n = 1, 2, . . . , N, 



arg min { — 3 %n 1 } (15) 

Ein— 1 771 

= = (16) 

= s i n s i n -l (17) 

where i n is the index of the energy arrival epoch when the power p n switches to p n+ i, i.e., at 
t = s in , p n switches to p n+1 . 

The proof of this theorem is given in Appendix |Al 

Therefore, we conclude that if the overall transmission duration T is known, then the optimal 
transmission policy is known via Theorem \T\ In particular, optimal transmission policy is the 
one that yields the tightest piecewise linear energy consumption curve that lies under the energy 
harvesting cure at all times and touches the energy harvesting curve at t = T. On the other hand, 
the overall transmission time T is what we want to minimize, and we do not know its optimal 
value up front. Consequently, we do not know up front which energy harvests will be utilized. 
For example, if the number of bits is small, and E is large, then, we can empty the data queue 
before the arrival of E\, thus, the rest of the energy arrivals are not necessary. Therefore, as a 
first step, we first obtain a good lower bound on the optimal transmission duration. 

We first illustrate our algorithm through an example in Fig. [8j We first compute the minimal 
energy required to finish the transmission before s\. We denote it as A\, and it equals 

Ai = W^U (18) 



si 

Then, we compare it with E . If A x < E , it implies that we can complete the transmission before 
the arrival of the first energy harvest, thus E\ is not necessary for the transmission. We allocate 
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Fig. 8. An illustration of the algorithm. 



E evenly to B bits, and the duration A x is the minimum transmission duration. On the other 
hand, if A\ > E , which is the case in the example, the final transmission completion time should 
be longer than s±. Thus, we move on and compute A 2 , A 3 , A±, and find that A 2 > ^2 i=0 Ez, 
A 3 > ^2 i=0 Ei and A 4 < Y^=oEi- This means that the total transmission completion time will 
be larger than S3 and energies E , . . ., E 3 will surely be utilized. Then, we allocate Y^H=o^i 
evenly to B bits and obtain a constant transmission power pi, which is the dotted line in the 
figure. The corresponding transmission duration is T\. Based on our allocation, we know that the 
final optimal transmission duration T must be greater than 7\. This is because, this allocation 
assumes that all E , . . ., E s are available at the beginning, i.e., at time t — 0, which, in fact, are 
not. Therefore, the actual transmission time will only be larger. Thus, T\ is a lower bound for 
T. 

Next, we need to check the feasibility of p±. Observing the figure, we find that pi is not 
feasible since it is above the staircase energy harvesting curve for some duration. Therefore, we 
connect all the corner points on the staircase curve before t — T\ with the origin, and find the 
line with the minimum slope among those lines. This corresponds to the red solid line in the 
figure. Then, we update p\ with the slope pi, and the duration for pi is l x = s iL . We repeat this 
procedure at t = and obtain p 2 , and continue the procedure until all of the bits are finished. 
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We state our algorithm for the general scenario as follows: First, we compute the amounts of 
energy required to finish the entire transmission before s±, ■ ■ ., sk, respectively, at a constant 
rate. We denote these as A{. 

A i = g- 1 ( I ^J Si , i = l,2,...,K (19) 

Then, we compare A{ with X]}=o an( ^ ^ n< ^ me smallest i such that A\ < X]}=o Ej- We denote 
this i as «!. If no such i\ exists, we let %\ — K + 1. 

Now, we assume that we can use Y^=o Ej t0 transmit all B bits at a constant rate. We 
allocate the energy evenly to these bits, and the overall transmission time 7\ is the solution of 



and the corresponding constant transmit power is 



Ti = B (20) 



E'l-l rp 

Pi = —h^ (21) 

1 i 



Next, we compare pi with —2— - — for every i < i\. If pi is smaller than every term, then, 



maintaining p x is feasible, and the optimal policy is to transmit at a constant transmission rate 
g(pi) with duration T\, which gives the smallest possible transmission completion time, = . 
Otherwise, maintaining p\ is infeasible under the given energy arrival realization. Thus, we update 

— f (22) 

Si 

Eii-i p 

Pi = — (23) 

i.e., over the duration [0, SjJ, we choose to transmit with power pi to make sure that the energy 
consumption is feasible. Then, at time t — s^, the total number of bits departed is g{pi)si 1 , 
and the remaining number of bits is B — gipijs^. Subsequently, with initial number of bits 
Bo — g(pi)Si 1 , we start from s^, and get another lower bound on the overall transmission duration 
T 2 , and repeat the procedure above. Through this procedure, we obtain p 2 ,ps, . . . ,Pn, and the 
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corresponding i 2 , z - 3, . . . , ijv, until we finish transmitting all of the bits. 

Based on our allocation algorithm, we know that p x is optimum up to time Ti, since it 
corresponds to the minimal slope line passing through the origin and any corner point before 
t = Ti. However, the algorithm also implies that the final transmission duration T will be larger 
than Ti. The question then is, whether p\ is still the minimum slope line up to time T. If we 
can prove that pi is lower than the slopes of the lines passing through the origin and any corner 
point in [Ti, T], then, using Theorem [H we will claim that p 1 is the optimal transmission policy, 
not only between [0,Ti], but also between [0, T]. 

The fact that this will be the case can be illustrated through the example in Fig. [8] We note 
that, clearly, Ti is a lower bound on the eventual T. If we keep transmitting at power p 1 , if no 
additional energy arrives, the energy harvested up until , i.e., Yl % i=Q Ei, will be depleted by 
time T[. We will next prove that T[ is an upper bound on T. Because of the concavity of the 
function g(p) in p, we can prove that under this policy, the number of bits departed up to time 
T[ is greater than B . Therefore, since potentially additional energy will arrive, T[ provides an 
upper bound. Thus, we know that the optimal T lies between Ti and T[. We next note that if 
we connect the origin with any corner point of the staircase curve between Ti and T[, the slope 
of the resulting line will be larger than p\, thus, p\ will be the smallest slope not only up to 
time Ti, which is a lower bound, but also up to time T[, which is an upper bound. This proves 
that while we do not know the optimal T, if we run the algorithm with respect to the lower 
bound on T, i.e., Ti, it will still yield an optimal policy, in that the resulting policy will satisfy 
Theorem [TJ 

We prove the optimality of the algorithm formally in the following theorem. 

Theorem 2 The allocation procedure described above gives the optimal transmission policy. 
The proof of this theorem is given in Appendix [Bj 

III. Scenario II: Packets Arrive during Transmissions 

In this section, we consider the situation where packets arrive during transmissions. We assume 
that there is an E amount of energy available at time t = 0, and at times s\, s 2 , ■ ■ ., s K , energy 
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is harvested in amounts E\, E 2 , ■ ■ ■ , E K , respectively, as in the previous section. We also assume 
that at t = 0, we have B bits available, and at times t±, t 2 , . . ., tM, bits arrive in amounts B\, 
B 2 , . . . , Bm, respectively. This system model is shown in Fig. [2] Our objective is again to 
minimize the transmission completion time, T, which again is the time by which the last bit is 
delivered to the destination. 

Let us denote the sequence of transmission powers by p±, p 2 , ■ ■ ., Pn, an d the corresponding 
transmission durations by l\, l 2 , . . ., In. Then, the optimization problem becomes: 

min T 
p,i 

s.t. E(t) < ^ E h < t < T 

i:Si<t 

Bit) <^2Bi, < t < T 

i:ti<t 
M 

B{T) = Y,Bi (24) 

i=0 

where E(t), B(t) are defined in (0Q) and ©. We again determine the properties of the optimal 
transmission policy in the following three lemmas. 

Lemma 4 Under the optimal solution, the transmission rates increase in time, i.e., T\ < r 2 < 
■■■ <r N . 

Proof: First, note that since the relationship between power and rate, r = g(p), is monotone, 
stating that the rates increase monotonically is equivalent to stating that the powers increase 
monotonically. We follow steps similar to those in the proof of Lemma Q] to prove this lemma. 
Assume that the rates do not increase monotonically, i.e., that we can find two rates such that 
Ti > r i+ i, with duration U, respectively. If % + 1 ^ N, then, let 

U = r i+1 = (25) 

H T H+l 
_1 / r ih + r i+lh+l 



ft=ft» = » 1 k + k+l ) < 26 > 



14 



Since r { > r\ = r' i+1 > r i+1 , Pi > p[ = p' i+1 > Pi+i, it is easy to verify that the new policy 
is feasible up to the end of U + \, from both the data and energy arrival points of view. On 
the other hand, based on the convexity of g^ 1 , the energy spent over the duration U + is 
smaller than pili +p i+ il i+1 . If we allocate the saved energy over to the last transmission duration, 
without conflicting any energy or data constraints, the transmission will be completed in a shorter 
duration. If i + 1 = N, then, we let 



Then, from ©, under the new policy, the last bit will depart before the end of The energy 
and data arrival constraints are satisfied over the whole transmission duration. Consequently, the 
original policy could not be optimal. Therefore, the optimal policy must have monotonically 
increasing rates (and powers). ■ 

Lemma 5 The transmission power/rate remains constant between two event epoches, i.e., the 
rate only potentially changes when new energy is harvested or a new packet arrives. 

Proof: This lemma can be proved through a procedure similar to that in Lemma [2l If power/rate 
is not constant between two event epoches, then, by equalizing the rate over the duration while 
keeping the total departures fixed, we can save energy. Allocating this saved energy to the last 
transmission duration, we can shorten the whole transmission duration. Thus, if power/rate is 
not constant between two event epoches, the policy cannot be optimal. ■ 

Lemma 6 If the transmission rate changes at an energy harvesting epoch, then the energy 
consumed up to that epoch equals the energy harvested up to that epoch; if the transmission 
rate changes at a packet arrival epoch, then, the number of packets departed up to that epoch 
equals the number of packets arrived up to that epoch; if the event epoch has both energy and 
data arrivals at the same time, then, one of the causality constraints must be met with equality. 




Pih + Pi+ih+i 

U + h+1 



(27) 




(28) 
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Proof: This lemma can be proved through contradiction using techniques similar to those used 
in the proof of Lemma [3l When the transmission rate changes at an energy harvesting epoch, if 
the energy consumed up to that time is not equal to the total amount harvested, then, without 
conflicting the energy causality constraint, we can always increase the rate before that epoch 
a little and decrease the rate after that epoch a little while keeping the total departures fixed. 
This policy would save some energy which can be used to shorten the transmission durations 
afterwards. Thus, the energy constraint at that epoch must be satisfied as an equality. The 
remaining situations can be proved similarly. ■ 

Based on Lemmas |4] [5] and [6l we can identify the structure of the unique optimal transmission 
policy as stated in the following theorem. In order to simplify the notation, we define U{ to be 
the time epoch when the 2th arrival (energy or data) happens, i.e., 

u x = min{si,ti} (29) 
u 2 = min{si,tj : Sj > U\,tj > Ui} (30) 

and so on, until the last arrival epoch. 

Theorem 3 For a given energy harvesting and packet arrival profile, the transmission policy 
with a transmission rate vector r = [r±, r 2 , . . . , r^] and the corresponding duration vector 
1 = [h,h, ■ ■ ■ ,In] I s optimal, if and only if it has the following structure: 

N M 

J2rih = J2 B * (3!) 

i=l i=0 

9 , 1 > (32) 

y Ui j Ui 

Let i\ he the index of u associated with r±. Then, with updated amount of bits and energy 
remaining in the system at t = u^, r 2 is the smallest feasible rate starting from Ui v and so on. 

The proof of this theorem is given in Appendix O 

For a given optimal transmission duration, T, the optimal policy which has the structure in 
Theorem [3] is unique. However, since we do not know the exact transmission duration up front, 
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we obtain a lower bound on T first, as in the previous section. In this case also, we develop a 
similar procedure to find the optimal transmission policy. The basic idea is to keep the transmit 
power/rate as constant as possible throughout the entire transmission duration. Because of the 
additional casuality constraints due to data arrivals, we need to consider both the average data 
arrival rate as well as the average power the system can support for feasibility. 

If sk < tM, i-e., bits have arrived after the last energy harvest, then, all of the harvested 
energy will be used. First, we assume that we can use these energies to maintain a constant rate, 
and the transmission duration will be the solution of 



9 



2^=0^' 



T 



r = 5> 03) 

3=0 



Then, we check whether this constant power/rate is feasible. We check the availability of the 
energy, as well as the available number of bits. Let 

2^=0 L 3 I 2^=0 h i nA . 

— } , Pi = — (34) 

Ui Ui 

tib = arg mm < — > , n = — (35) 

Ui<T Ui Ui 



We compare min(pi, g~ l {ri)) with 3 = ° J . If the former is greater than the latter, then the 
constant transmit power 3= ° 3 is feasible. Thus, we achieve the minimum possible transmission 
completion time T. Otherwise, constant-power transmission is not feasible. We choose the 
transmit power to be the smaller of pi and g^ 1 ^), and the duration to be the one associated 
with the smaller transmit power. We repeat this procedure until all of the bits are transmitted. 

If sk > thit then, as in the first scenario where packets have arrived and are ready before 
the transmission starts, some of the harvested energy may not be utilized to transmit the bits. In 
this case also, we need to get a lower bound for the final transmission completion time. Let u n 
be the energy harvesting epoch right after Then, starting from u n , we compute the energy 
required to transmit Ylj=o Bj bits at a constant rate by Ui, u n <Ui < uk+m, and compare them 
with the total energy harvested up to that epoch, i.e., J2j. s . <Ui Ej. We identify the smallest i 
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such that the required energy is smaller than the total harvested energy, and denote it by i x . If 
no such i\ exists, we let i\ — M + K + 1. 

Now, we assume that we can use Yj- Sj <u~. Ej t0 transmit YjLo Bj bits at a constant rate. We 
allocate the energy evenly to these bits, and the overall transmission time 7\ is the solution of 



9 



V E\ M 



Ti 

and the corresponding constant transmit power is 



3=0 



Pi = J T (37) 

Next, we compare p\ with ^ J ' Sj< "' 3 and g^ 1 I ^ j *j < "' 3 [ f or every i < %\. If pi is smaller 
than all of these terms, then, maintaining p\ is feasible from both energy and data arrival points 
of view. The optimal policy is to keep a constant transmission rate at g(pi) with duration T x , 
which yields the smallest possible transmission completion time, i\ = i±. Otherwise, maintaining 
Pi is not feasible under the given energy and data arrival realizations. This infeasibility is due to 
the causality constraints on either the energy or the data arrival, or both. Next, we identify the 
tightest constraint, and update the transmit power to be the power associated with that constraint. 
We repeat this procedure until all of the bits are delivered. 

Theorem 4 The transmission policy obtained through the algorithm described above is optimal. 
The proof of this theorem is given in Appendix [Ql 

IV. Simulation Results 

We consider a band-limited additive white Gaussian noise channel, with bandwidth W = 
1MHz and the noise power spectral density N = 10~ 19 W/Hz. We assume that the distance 
between the transmitter and the receiver is 1km, and the path loss is about HOdB. Then, we 
have g(p) = W\og 2 ^1 + j^fJ = log 2 (l + Y(^)Mbps. It is easy to verify that this function 
has the properties assumed at the beginning of Section lU For the energy harvesting process, 
we assume that at times t = [0, 2, 5, 6, 8, 9, ll]s, we have energy harvested with amounts E = 
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[10, 5, 10, 5, 10, 10, 10]mJ, as shown in Fig. |9] We assume that at t = 0, we have 5.44Mbits 
to transmit. We choose the numbers in such a way that the solution is expressable in simple 
numbers, and can be potted conveniently. Then, using our algorithm, we obtain the optimal 
transmission policy, which is shown in Fig. [9j We note that the powers change only potentially 
at instances when energy arrives (Lemma [2]); when the power changes, energy consumed up to 
that point equals energy harvested (Lemma [3]); and power sequence is monotonically increasing 
(Lemma [Q). We also note that, for this case, the active transmission is completed by time 
T = 9.5s, and the last energy harvest at time t = lis is not used. 

Next, we consider the scenario where data packets arrive during the transmissions. We consider 
a smaller time scale, where each unit consists of 10ms. We assume that at times t = [0, 5, 6, 8, 9], 
energies arrive with amounts E = [5,5,5,5,5] x 10" 2 mJ, while at times t = [0,4, 10], packets 
arrive with equal size lOkbits, as shown in Fig. \\0\ We observe that the transmitter changes its 
transmission power during the transmissions. The first change happens at t = 5 when energy 
arrives, and the energy constraint at that instant is satisfied with equality, while the second change 
happens at t — 10 when new bits arrive, and the traffic constraint at that time is satisfied with 
equality. 
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Fig. 9. Optimal transmit powers p — [3, 5, 10, 20]mW, with durations 1 = [5, 3, 1, 0.5]s. 



V. Conclusions 

In this paper, we investigated the transmission completion time minimization problem in an 
energy harvesting communication system. We considered two different scenarios, where in the 
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Fig. 10. Optimal transmit powers p = [1, 2, 10]mW, with durations 1 = [5, 5, 1] x 10 



first scenario, we assume that packets have already arrived and are ready to be transmitted 
at the transmitter before the transmission starts, and in the second scenario, we assume that 
packets may arrive during the transmissions. We first analyzed the structural properties of the 
optimal transmission policy, and then developed an algorithm to obtain a globally optimal off-line 
scheduling policy, in each scenario. 

Appendix 

A. The Proof of Theorem Q] 

We will prove the necessariness and the sufficiency of the stated structure separately. First, 
we prove that the optimal policy must have the structure given above. We prove this through 
contradiction. Assume that the optimal policy, which satisfies Lemmas [Q [2] and [3l does not have 
the structure given above. Specifically, assume that the optimal policy over the duration [0, Sj n _ 1 ) 
is the same as the policy described in Theorem 1, however, the transmit power right after s in _ 1 , 
which is p n , is not the smallest average power possible starting from Sj n _ 1 , i.e., we can find 
another < s iN , such that 

Pn > = V (38) 

s i' S i n -i 

Based on Lemma [3l the energy consumed up to s in __ x is equal to YlT=o~ Ej, i.e., there is no 
energy remaining at t = s~ . 
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We consider two possible cases here. The first case is that sy < s in , as shown in Fig. 1 1 1(a) 
Under the optimal policy, the energy required to maintain a transmit power p n over the duration 
[ S i„-D s i') is Pn{ s i' ~ s i n -i)- Based on (1381) . this is greater than the total amount of energy 
harvested during [si n _ i: Sy), which is Y^jZi^Ej. Therefore, this energy allocation under this 
policy is infeasible. 

On the other hand, if sy > s in , as shown in Fig. |1 l(b)[ then the total amount of energy 
harvested over [s in , sy) is Yl]Zl n Ej. From (1381) . we know 

j=in-l 3 - ^j=in-l i ' _ 2-^ 



Pn 



> 



> 



E< 

'J=ln 3 



Si' 



Si' 



(39) 



Thus, under any feasible policy, there must exist a duration I C [s in , sy), such that the transmit 
power over this duration is less than p n . This contradicts with Lemma [B Therefore, this policy 
cannot be optimal. 
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Fig. 11. Two different cases in the proof of Theorem 1. 
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Next, we prove that if a policy with power vector p and duration vector 1 has the structure 
given above, then, it must be optimal. We prove this through contradiction. We assume that 
there exists another policy with power vector p' and duration vector 1', and the transmission 
completion time T' under this policy is smaller. 

We assume both of the policies are the same over the duration [0, Sj n _J, however, the transmit 
policies right after Sj n _ 1? which are p n and p' n , with durations l n and l' n , respectively, are different. 
Based on the assumption, we must have p n < p' n . 

If l n < l' n , from Lemma [3j we know that the total energy available over s in ) is equal to 

p n l n . Since p n < p' n , p' n is infeasible over [s^^, s in ). Thus, policy p' cannot be optimal. Then, 
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we consider the case when l n > l' n . If V > s in , then, the total energy spent over [s in _ 1 ,Sj n ) 
under p' is greater than p n l n , since p' n > p n , and p' n+1 > p' n based on Lemma [Q If V < s in , 
since the power-rate function g is concave, the total number of bits departed over [si n _ i; Sj n ) 
under p is greater than that under p'. Thus, policy p' cannot depart B bits over T', and it 
cannot be optimal. 

In summary, a policy is optimal if and only if it has the structure given above, completing 
the proof. 

B. The Proof of Theorem [2] 

Let T be the final transmission duration given by the allocation procedure. Then, we have 
B(T) = Bq. In order to prove that the allocation is optimal, we need to show that the final 
transmission policy has the structure given in Theorem \T\ We first prove that p x satisfies ( fT6l) . 
Then, we can similarly prove that p 2 , p^, ... satisfy (IT6T ). 

We know that if T = T\, then it is the minimum possible transmission completion time. We 
know that this transmit policy will satisfy the structural properties in Theorem \T\ Otherwise, the 
final optimal transmission time T is greater than T\, and more harvested energy may need to be 
utilized to transmit the remaining bits. From the allocation procedure, we know that 

pi < j=0 \ Vz < h (40) 



In order to prove that p\ satisfies (|T61) . we need to show that 

Pi < J ~° , Vz : sj < Si < T (41) 

Si 

If we keep transmitting with power p 1? then at T[ = — -, the total number of bits departed 
will be 

9(pi)T{ > g (^^j Ti = B (42) 
where the inequality follows from the assumption that g(p)/p decreases in p. Then, (|40l) guar- 
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antees that this is a feasible policy. Thus, under the optimal policy, the transmission duration T 
will be upper bounded by T[, i.e., 

^1-1 



T < 



which implies 



Pi 



(43) 



Pi 



< 



(44) 



If T < si , as shown in Fig. |12(a)[ no future harvested energy is utilized for the transmissions. 
Then, (|44)) guarantees that (HT)) is satisfied. 



If T > s% , as shown in Fig. |12(b)[ additional energy harvested after should be utilized to 
transmit the data. We next prove that (|4TT) still holds through contradiction. Assume that there 
exists i' with sj < Si> < T, such that (l4~Tj) is not satisfied, i.e., 



Pi > = p 



(45) 



Then, 



Pi 



< Si 



(46) 



Combining this with (1431) . we have T < Sj/, which contradicts with the assumption that < T. 
Thus, (l4~TT) holds, pi satisfies the requirement of the optimal structure in (l40l) . 

We can then prove using similar arguments that p 2 , pz, ■ ■ ■ also satisfy the properties of the 
optimal solution. Based on Lemma Q3 this procedure gives us the unique optimal policy. 
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Fig. 12. Two different cases in the proof of Theorem 2. 
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C. The Proof of Theorem \3\ 

First, we prove that for the optimal transmission policy, r\ must satisfy (l32l) . We prove this 
through contradiction. If r\ does not satisfy (1321) . then, we can always find another u^, such that 

f fy. „ eA y ^ BA 

ri>min fl J , 3 > (47) 

I y J u v 

First, we assume that g ( ^ J ' Sj< "' / — - J < ^ J ' tj<v — -. Then, if < u^, clearly n is not feasible 



over the duration [0, iv), because of the energy constraint. If u v > u ix , then, the transmitter 
cannot maintain a transmission rate that is always greater than T\ over Wj/), from the energy 
point of view. This contradicts with Lemma |H Similarly, if g (—L^-H. — I J > ^ J,fj< "'' — the 



"bottleneck" is the data constraint. We can prove that t\ is not feasible. Thus, T\ must be the 
smallest feasible rate starting from t = 0, as in (|32|) . We can also prove that r 2 , r 3 , ... must 
have the same structure, in the same way. Next, we can prove that any policy has the structure 
described above is optimal. We can prove this through contradiction. Assume that there exists 
another policy with a shorter transmission completion time. Based on Lemmas @] and [6l we can 
prove that this policy could not be feasible. 

D. The Proof of Theorem 

First we prove that r\ obtained through this procedure satisfies (l32l) . If T — T\, i.e., the constant 
rate is achievable throughout the transmission, then it is the shortest transmission duration we 
can get, thus, it is optimal. If T ^ T\, from the procedure, we have 

n < mm < g , 3 - } (48) 

l<i<h \ Ui Ui 



We need to prove that 

T , eA y.^ 



n < min \g ( J ^ ' J , J ^ 1 ) for u 5l < u t < T. (49) 
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Considering the policy with a constant power p 1 = (jf~ 1 (r 1 ), then, at T[ = — -, the total 



Pi 

number of bits departed will be 



9(Pi)T[>g[^^)T 1 = J2B 3 (50) 



3=0 



yfis 1 b 

while at T" = 3 , the total energy required will be 

>»1 — 1 771 V^»l — 1 D 81—1 



1 



where the inequality follows from the assumption that g(p)/p decreases in p. Therefore, main- 
taining a transmission rate r± until the last bit departs the system is feasible from both the energy 
and data arrival points of view. Thus, under the optimal policy, the transmission duration T will 
be upper bounded by T[ and T", i.e., 



T < Z " j=0 J ' , T < Z " i=0 j (52) 
Pi ri 



which implies 



Eii-i p y^ii— l R 

j=0 ^ ^j=0 D 3 , c ~. 

Pi S — ^ , n < — ^ (53) 

If no future harvested energy is utilized for the transmissions, (1531) guarantees that (l32l) is satisfied. 

If T > , additional energy harvested after u- i± should be utilized to transmit the data. We next 
prove that ( |49| ) still holds through contradiction. Assume that there exists i' with u~ h < Uy < T, 
such that (l49l) is not satisfied, i.e., 



Ei'-l rp ■sr^i'-l R 

3=0 i ^ ^3=0 3 /C/1 s 

Pi > — or n > — (54) 



Then, we have 



— < or — < Ui> (55) 

Pi n 
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Combining this with (1521) , we have T < w^, which contradicts with the assumption that uy < T. 
Thus, ( |49l ) holds, r\ satisfies the requirement of the optimal structure in (l32l) . We can then prove 
using a similar argument that r 2 , r 3 , . . . also satisfy the structure of the optimal solution. Based 
on Theorem [3l this procedure gives us the unique optimal transmission policy. 
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